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® Probe groups for the detection of nucleotide variations and genetic polymorphisms. 

® The invention provides a group of oligonucleotide probes in which the individual probes are capable of 
hybridizing to specific gene sequence variations so as to permit detection of said variations when said probes 
are labelled and so-hybridized, whereby allelic variations and gene polymorphism in samples may be detected 
using said group of probes. Such probe groups are useful for detecting nucleotide variations, mutations and 
polymorphisms by hybridization to amplified nucleic acid sequence containing such variations, mutations or 
polymorphisms. 
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This invention relates to probe groups for the detection of nucleotide variations and genetic polymor- 
phisms which may be used in processes for detecting nucleotide variations, mutations and polymorphisms 
by amplifying nucleic acid sequences suspected of containing such mutations or polymorphisms and 
detecting them in a dot blot format. 
5 In recent years, the molecular basis of a number of human genetic diseases has been elucidated by the 
application of recombinant DNA technology. In particular, the detection of specific polymorphic restriction 
sites in human genomic DNA associated with genetic disease, such as sickle-cell anemia, has provided 
clinically valuable information for prenatal diagnosis. In these studies, the presence or absence of a specific 
site is revealed by restriction fragment length polymorphism (RFLP) analysis, a method in which variation in 

w the size of a specific genomic restriction fragment is detected by Southern blotting and hybridization of the 
immobilized genomic DNA with a labeled probe. RFLP analysis has proved useful in the direct detection of 
polymorphic sites that contain the mutation conferring the disease phenotype (e.g., Mstll and sickle-cell 
anemia) as well as in linkage studies where a particular allelic restriction site is IinkedTo a disease locus 
within a family but not necessarily in the general population. See, for example, Kan and Dozy, PNAS (USA), 

75 75, 5631 (1978), and Rubin and Kan, Lancet , 1985-1, 75 (1985). See also Geever et a!., PNAS (USA), 78, 
5081 (1981) and Wilson et al., PNAS (USA), 79, 3628 (1982). ~ 
In a second method, called "Oligomer~festriction n , a synthetic end-labeled oligonucleotide probe is 
annealed in solution to the target genomic DNA sequence and a restriction enzyme is added to cleave any 
hybrids formed. This method, the specificity of which depends on the ability of a base pair mismatch within 

20 the restriction site to abolish or inhibit cleavage, is described more fully by Saiki et al., Biotechnology, 3, 
1008-1012 (1985). In addition, the sensitivity of this technique may be enhanced by utilizing a polymerase 
chain reaction procedure wherein the sample is first subjected to treatment with specific primers, poly- 
merase and nucleotides to amplify the signal for subsequent detection. This is described more fully by Saiki 
et al., Science . 230, 1 350-1 353 (1 985). 

25 A third method for detecting allelic variations which is independent of restriction site polymorphism 
utilizes sequence-specific synthetic oligonucleotide probes. See Conner et al., PNAS (USA), 80. 78 (1983). 
This latter technique has been applied to the prenatal diagnosis of a 1 -antitrypsin deficiency"" [Kidd et al., 
Nature , 304 , 230 (1983)) and jS-thalassemia (Pirastu et al., N. Engl. J. Med. , 309 . 284 (1983)). In addition, 
the technique has been applied to study the polymorphism of HLA-DR/3 using Southern blotting (Angelini et 

30 al., PNAS (USA), 83, 4489-4493 (1986)). 

The basis for this procedure is that under appropriate hybridization conditions a short oligonucleotide 
probe of at least 19 bases (19-mer) will anneal only to those sequences to which it is perfectly matched, a 
single base pair mismatch being sufficiently destabilizing to prevent hybridization. The distinction between 
the allelic variants is based on the thermal stability of the duplex formed between the genomic DNA and the 

35 oligonucleotide (19-mer) probe. 

In addition, methods for detecting base pair mismatches in double-stranded RNA and RNA:DNA 
heteroduplexes have been described using pancreatic ribonuclease (RNase A) to cleave the 
heteroduplexes. Winter et al., PNAS (USA), 82:7575-7579 (1985) and Myers et al.. Science , 230:1242-1246 
(1985). The principal deficiency of this method is its inability to recognize all types "oT base pair 

40 mismatches. 

Both the RFLP and oligonucleotide probe stability methods are relatively complex procedures, requiring 
restriction enzyme digestion, gel-fractionation of the genomic DNA, denaturation of the DNA, immobilization 
of the DNA either by transfer to a filter membrane or dessication of the gel itself, and hybridization of a 
labeled probe to the electrophoretically resolved array of immobilized genomic restriction fragments. These 

45 steps are necessary, for the oligonucleotide probe stability method, due to the complexity of human 
genomic DNA. Restriction and electrophoresis are necessary to separate the target sequence ("signal") 
from the rest of the genome ("noise"), and hybridization in the gel (instead of filter transfer) is necessary to 
retain as much target sequence as possible. Even then, detection of a signal in a 10 ug sample using a 
high specific activity kinased probe requires an autoradiographic exposure of four days. 

so In addition, the approach of Conner et al. requires at least a 19-mer probe for reasons of specificity (a 
shorter probe would hybridize to more genomic fragments), as well as possibly sensitivity. Shorter probes 
(e.g., 16-mers). however, would show more sequence-specific discrimination because a single mismatch 
would be more destabilizing. 

In copending EP-A-0237362, from which this application is divided, there is described and claimed a 

55 process for detecting the presence of a specific nucleotide sequence in nucleic acid in a sample, which 
process comprises: 

(a) treating the sample, together or sequentially, with four different nucleoside triphosphates, an agent for 
polymerization of the nucleoside triphosphates, and two oligonucleotide primers for said nucleic acid 
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under hybridizing conditions such that a primer will hybridize to said nucleic acid and an extension 
product of the primer be synthesized which is complementary to said nucleic acid, wherein said primers 
are selected such that the extension product synthesized from one primer, when separated from its 
complements, can serve as a template for synthesis of the extension product of the other primer; 
5 (b) treating the sample under denaturing conditions to separate the primer extension products from their 
templates; 

(c) treating the sample, together or sequentially, with said four nucleoside triphosphates, an agent for 
polymerization of the nucleoside triphosphates, and oligonucleotide primers such that a primer extension 
product is synthesized using each of the single strands produced in step (b) as a template, wherein 

to steps (b) and (c) are repeated a sufficient number of times exponentially to increase the amount of said 
nucleic acid and to result in detectable amplification thereof, 

(d) directly transferring, without gel fractionation, product derived from step (c) to a membrane; 

(e) treating the membrane from (d) under hybridization conditions with a labeled sequence-specific 
oligonucleotide probe capable of hybridizing with the amplified nucleic acid only if a sequence of the 

75 probe is complementary to a region of the amplified nucleic acid; and 

(f) detecting whether the probe has hybridized to an amplified nucleic acid in the sample. 

This method allows for detecting single or multiple nucleotide variations in nucleic acid sequence from 
any source, for use in detecting any type of disease or condition. The method herein directly detects the 
sequence variation, eliminating the need for restriction digestion, electrophoresis, and gel manipulations 

20 otherwise required. In addition, the method herein provides for improved specificity and sensitivity of the 
probe; an interpretable signal can be obtained with a 0.04 ug sample in six hours. Thirdly, if the amount of 
sample spotted on a membrane is increased to 0.1-0.5 jxg, non-isotopically labeled oligonucleotides may be 
utilized rather than the radioactive probes used in previous methods. Finally, the method is applicable to 
use of sequence-specific oligonucleotides less than 19-mers in size, thus allowing use of more discrimi- 

25 natory sequence-specific oligonucleotides. 

In a variation of the above method, the primer(s) and/or nucleotide triphosphates are labeled so that the 
resulting amplified sequence is labeled. The labeled primer(s) and/or nucleotide triphosphate(s) can be 
present in the reaction mixture initially or added during a later cycle. The sequence-specific oligonucleotide 
(unlabeled) is affixed to a membrane and treated under hybridization conditions with the labeled amplifica- 

30 tion product so that hybridization will occur only if the membrane-bound sequence is present in the 
amplification product. 

The invention now provides a group of probes which may be used in such a process, specifically a 
group of oligonucleotide probes in which the individual probes are capable of hybridizing to specific gene 
sequence variations so as to permit detection of said variations when said probes are labelled and so- 

35 hybridized, whereby allelic variations and gene polymorphism in samples may be detected using said group 
of probes. Put another way, the inventive concept includes a method of providing a group of probes for 
detecting allelic variations or gene polymorphism in samples, comprising selecting as a group a number of 
oligonucleotide probes in which the individual probes are capable of hybridizing to specific gene sequence 
variations so as to permit detection of said variations when said probes are labelled and so-hybridized. 

40 Regarding genetic diseases, while RFLP requires a polymorphic restriction site to be associated with 
the disease, sequence-specific oligonucleotides directly detect the genetic lesion and are generally more 
useful for the analysis of such genetic diseases as hemoglobin C disease, a1 -antitrypsin and ^-thalassemia 
which result from single or multiple base mutations. In addition, the oligonucleotides can be used to 
distinguish between, genetic variants which represent different alleles (e.g., HLA typing), indicating the 

45 feasibility of a sequence-specific oligonucleotide-based HLA typing kit 

The term "nucleotide variation in sequence" refers to any single or multiple nucleotide substitutions, 
deletions or insertions. These nucleotide variations may be mutant or polymorphic allele variations ,eg 
single nucleotide changes in nucleic acids such as occur in 0-giobin genetic diseases caused by single- 
base mutations, additions and deletions (some /^-thalassemias, sickle cell anemia, hemoglobin C disease, 

so etc.), as well as multiple-base variations such as are involved with a-thalassemia or some ^-thalassemias. 
Polymorphisms, which are not necessarily associated with a disease, can also be detected, but are merely 
a condition in which two or more different nucleotide sequences (whether having substituted, deleted or 
inserted nucleotide base pairs) can exist at a particular site in the nucleic acid in the population, as with 
HLA regions of the human genome and random polymorphisms such as. in mitochondrial DNA. The 

55 polymorphic sequence-specific oligonucleotide probes described in detail hereinafter may be used as 
genetic markers linked to a disease such as insulin-dependent diabetes or in forensic applications. If the 
nucleic acid is double-stranded, the nucleotide variation in sequence becomes a base pair variation in 
sequence. 
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The term "oligonucleotide " as used herein is defined as a molecule comprised of two or more 
deoxyribonucleotides or ribonucleotides, preferably more than three. Its exact size will depend on many 
factors, which in turn depend on the ultimate function or use of the oligonucleotide. The oligonucleotide may 
be derived synthetically or by cloning. 
5 The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally as in a 
purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of 
synthesis when placed under conditions in which synthesis of a primer extension product which is 
complementary to a nucleic acid strand is induced, i.e., in the presence of four different nucleotide 
triphosphates and an agent for polymerization such as DNA polymerase in an appropriate buffer ("buffer" 

io includes pH, ionic strength, cofactors. etc.) and at a suitable temperature. 

The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively 
be double stranded. If double stranded, the primer is first treated to separate its strands before being used 
to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the agent for polymeriza- 

rs tion. The exact lengths of the primers will depend on many factors, including temperature and source of 
primer and use of the method. For example, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 nucleotides, although it may contain more or fewer 
nucleotides. Short primer molecules generally require lower temperatures to form sufficiently stable hybrid 
complexes with the template. 

20 The primers herein are selected to be "substantially" complementary to the different strands of each 
specific sequence to be amplified. This means that the primers must be sufficiently complementary to 
hybridize with their respective strands. Therefore, the primer sequence need not reflect the exact sequence 
of the template. For example, a non-complementary nucleotide fragment may be attached to the 5* end of 
the primer, with the remainder of the primer sequence being complementary to the strand. Typically, the 

25 primers have exact complementarity to obtain the best detection results. 

The term "sequence-specific oligonucleotides" refers to oligonucleotides which will hybridize to specific 
sequences, whether or not contained on alleles which sequences span the nucleotide variation being 
detected and are specific for the sequence variation being detected. In the present invention, more than one 
sequence-specific oligonucleotide is employed for each sequence, as described further hereinbelow. 

30 As used herein, the term "thermostable enzyme" refers to an enzyme which is stable to heat and is 
heat resistant and catalyzes (facilitates) combination of the nucleotides in the proper manner to form the 
primer extension products which are complementary to each nucleic acid strand. Generally, the synthesis 
will be initiated at the 3* end of each primer and will proceed in the 5' direction along the template strand, 
until synthesis terminates, producing molecules of different lengths. There may be thermostable enzymes, 

35 however, which initiate synthesis at the 5* end and proceed in the other direction, using the same process 
as described above. A purified thermostable enzyme is described more fully in Example VIII hereinbelow. 

The probe groups of the invention may used in a process involving amplifying any one or more specific 
nucleic acid sequences (as defined herein to contain one or more nucleotide variations) suspected of being 
in one or more nucleic acids. 

40 In general, this process involves a chain reaction for producing, in exponential quantities relative to the 
number of reaction steps involved, at least one specific nucleic acid sequence given (a) that the ends of the 
required sequence are known in sufficient detail that oligonucleotides can be synthesized which will 
hybridize to them, and (b) that a small amount of the sequence is available to initiate the chain reaction. The 
product of the chain reaction will be a discrete nucleic acid duplex with termini corresponding to the ends of 

45 the specific primers employed. 

Any nucleic acid, in purified or nonpurified form, can be utilized as the starting nucleic acid or acids, 
provided it is suspected of containing the sequence being detected. Thus, the process may employ, for 
example. DNA or RNA, including messenger RNA. which DNA or RNA may be single stranded or double 
stranded. In addition, a DNA-RNA hybrid which contains one strand of each may be utilized. A mixture of 

so any of these nucleic acids may also be employed, or the nucleic acids produced from a previous 
amplification reaction herein using the same or different primers may be so utilized. The specific nucleic 
acid sequence to be amplified may be only a fraction of a larger molecule or can be present initially as a 
discrete molecule, so that the specific sequence constitutes the entire nucleic acid. 

It is not necessary that the sequence to be amplified be present initially in a pure form; it may be a 

55 minor fraction of a complex mixture, such as a portion of the 0-globin gene contained in whole human DNA. 
or a portion of nucleic acid sequence due to a particular microorganism which organism might constitute 
only a very minor fraction of a particular biological sample. The starting nucleic acid may contain more than 
one desired specific nucleic acid sequence which may be the same or different Therefore, the process is 
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useful not only for producing large amounts of one specific nucleic acid sequence, but also for amplifying 
simultaneously more than one different specific nucleic acid sequence located on the same or different 
nucleic acid molecules when more than one of the base pair variations in sequence is present 

The nucleic acid or acids may be obtained from any source, for example, from plasmids such as 
5 pBR322, from cloned DNA or RNA, or from natural DNA or RNA from any source, including bacteria, yeast, 
viruses, organelles, and higher organisms such as plants or animals. DNA or RNA may be extracted from 
blood, tissue material such as chorionic villi or amniotic cells by a variety of techniques such as that 
described by Maniatis et al., Molecular Cloning (1982), 280-281. 

The cells may be directly used without purification of the nucleic acid if they are suspended in 

10 hypotonic buffer and heated to about 90-1 00 °C, until cell lysis and dispersion of intracellular components 
occur, generally about 1 to 15 minutes. After the heating step the amplification reagents may be added 
directly to the lysed cells. This direct cell detection method may be used on peripheral blood lymphocytes 
and amniocytes. 

Any specific nucleic acid sequence can be amplified . 

75 It is only necessary that a sufficient number of bases at both ends of the sequence be known in 
sufficient detail so that two oligonucleotide primers can be prepared which will hybridize to different strands 
of the desired sequence and at relative positions along the sequence such that an extension product 
synthesized from one primer, when it is separated from its template (complement), can serve as a template 
for extension of the other primer into a nucleic acid of defined length. The greater the knowledge about the 

20 bases at both ends of the sequence, the greater can be the specificity of the primers for the target nucleic 
acid sequence, and thus the greater the efficiency of the process. 

It will be understood that the word "primer" as used hereinafter may refer to more than one primer, 
particularly in the case where there is some ambiguity in the information regarding the terminal sequence(s) 
of the fragment to be amplified. For instance, in the case where a nucleic acid sequence is inferred from 

25 protein sequence information, a collection of primers containing sequences representing all possible codon 
variations based on degeneracy of the genetic code will be used for each strand. One primer from this 
collection will be homologous with the end of the desired sequence to be amplified. 

The oligonucleotide primers may be prepared using any suitable method, such as, for example, the 
organic synthesis of a nucleic acid from nucleoside derivatives. This synthesis may be performed in 

30 solution or on a solid support. One type of organic synthesis is the phosphotriester method, which has been 
utilized to prepare gene fragments or short genes. In the phosphotriester method, oligonucleotides are 
prepared that can then be joined together to form longer nucleic acids. For a description of this method, see 
Narang, S. A., et al., Meth. EnzymoK , 68, 90 (1979) and U.S. Patent No. 4,356,270. The patent describes the 
synthesis and cloning of the somatostatin gene. 

35 A second type of organic synthesis is the phosphodiester method, which has been utilized to prepare a 
tRNA gene. See Brown, E. L., et al., Meth. EnzymoL , 68, 109 (1979) for a description of this method. As in 
the phosphotriester method, this phosphodiester method involves synthesis of oligonucleotides that are 
subsequently joined together to form the desired nucleic acid. 

Automated embodiments of these methods may also be employed. In one such automated embodi- 

40 ment, diethylphosphoramidites are used as starting materials and may be synthesized as described by 
Beaucage et al., Tetrahedron Letters (1981), 22:1859-1862. One method for synthesizing oligonucleotides 
on a modified solid support is described in ILS. Patent No. 4,458,066. It is also possible to use a primer 
which has been isolated from a biological source (such as a restriction endonuclease digest). 

The specific nucleic acid sequence is produced by using the nucleic acid containing that sequence as a 

45 template. If the nucleic acid contains two strands, it is necessary to separate the strands of the nucleic acid 
before it can be used as the template, either as a separate step or simultaneously with the synthesis of the 
primer extension products. This strand separation can be accomplished by any suitable denaturing method 
including physical, chemical or enzymatic means. One physical method of separating the strands of the 
nucleic acid involves heating the nucleic acid until it is completely (>99%) denatured. Typical heat 

so denaturation may involve temperatures ranging from about 80 to 105* C for times ranging from about 1 to 
10 minutes. Strand separation may also be induced by an enzyme from the class of enzymes known as 
helicases or the enzyme RecA, which has helicase activity and in the presence of riboATP is known to 
denature DNA. The reaction conditions suitable for separating the strands of nucleic acids with helicases are 
described by Kuhn Hoffmann-Berling, CSH-Quantitative Biology , 43:63 (1978), and techniques for using 

55 RecA are reviewed in C. Radding, Ann. Rev. Genetics, 16:405-37 (1982). 

If the original nucleic acid containing the sequence variation to be amplified is single stranded, its 
complement is synthesized by adding one or two oligonucleotide primers thereto. If an appropriate single 
primer is added, a primer extension product is synthesized in the presence of the primer, an agent for 
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polymerization, and the four nucleotide triphosphates described below. The product will be partially 
complementary to the single-stranded nucleic acid and will hybridize with the nucleic acid strand to form a 
duplex of unequal length strands that may then be separated into single strands as described above to 
produce two single separated complementary strands. Alternatively, two appropriate primers may be added 

5 to the single-stranded nucleic acid and the reaction carried out. 

If the original nucleic acid constitutes the entire sequence variation to be amplified, the primer extension 
product(s) produced will be completely complementary to the strands of the original nucleic acid and will 
hybridize therewith to form a duplex of equal length strands to be separated into single-stranded molecules. 
When the complementary strands of the nucleic acid or acids are separated, whether the nucleic acid 

10 was originally double or single stranded, the strands are ready to be used as a template for the synthesis of 
additional nucleic acid strands. This synthesis can be performed using any suitable method. Generally it 
occurs in a buffered aqueous solution, preferably at a pH of 7-9, most preferably about 8. Preferably, a 
molar excess (for cloned nucleic acid, usually about 1000:1 primerrtemplate, and for genomic nucleic acid, 
usually about 10 G :1 primertemplate) of the two oligonucleotide primers is added to the buffer containing the 

75 separated template strands. It is understood, however, that the amount of complementary strand may not be 
known if the process is used for diagnostic applications, so that the amount of primer relative to the amount 
of complementary strand cannot be determined with certainty. As a practical matter, however, the amount of 
primer added will generally be in molar excess over the amount of complementary strand (template) when 
the sequence to be amplified is contained in a mixture of complicated long-chain nucleic acid strands. A 

20 large molar excess is preferred to improve the efficiency of the process. 

The deoxyribonucleoside triphosphates dATP. dCTP, dGTP and TTP are also added to the synthesis 
mixture in adequate amounts and the resulting solution is heated to about 90-100* C for from about 1 to 10 
minutes, preferably from 1 to 4 minutes. After this heating period the solution is allowed to cool to room 
temperature, which is preferable for the primer hybridization. To the cooled mixture is added an agent for 

25 polymerization, and the reaction is allowed to occur under conditions known in the art. This synthesis 
reaction may occur at from room temperature up to a temperature above which the inducing agent no 
longer functions efficiently. Thus, for example, if an E coli DNA polymerase is used as agent for 
polymerization, the temperature is generally no greater than about 40 °C. Most conveniently the reaction 
occurs at room temperature. 

so The agent for polymerization of the nucleotide triphosphates may be any compound or system which 
will function to accomplish the synthesis of primer extension products, including enzymes. Suitable 
enzymes for this purpose include, for example, E. coli DNA Polymerase I, Klenow fragment of E. coli DNA 
polymerase I, T4 DNA polymerase, other availabl^~DNA polymerases, reverse transcriptase7"and" other 
enzymes, including heat-stable enzymes, which will facilitate combination of the nucleotides in the proper 

35 manner to form the primer extension products which are complementary to each nucleic acid strand. 
Generally, the synthesis will be initiated at the 3' end of each primer and proceed in the 5* direction along 
the template strand, until synthesis terminates, producing molecules of different lengths. There may be 
agents, however, which initiate synthesis at the 5 1 end and proceed in the other direction, using the same 
process as described above. 

40 The newly synthesized strand and its complementary nucleic acid strand form a double-stranded 
molecule which is used in the succeeding steps of the process. In the next step, the strands of the double- 
stranded molecule are separated using any of the procedures described above to provide single-stranded 
molecules. 

New nucleic acid is synthesized on the single-stranded molecules. Additional agent for polymerization, 
45 nucleotides and primers may be added if necessary for the reaction to proceed under the conditions 
prescribed above. Again, the synthesis will be initiated at one end of the oligonucleotide primers and will 
proceed along the single strands of the template to produce additional nucleic acid. After this step, half of 
the extension product will consist of the specific nucleic acid sequence bounded by the two primers. 

The steps of strand separation and extension product synthesis can be repeated as often as needed to 
so produce the desired quantity of the specific nucleic acid sequence. As will be described in further detail 
below, the amount of the specific nucleic acid sequence produced will accumulate in an exponential 
fashion. 

When it is desired to produce more than one specific nucleic acid sequence from the first nucleic acid 
or mixture of nucleic acids, the appropriate number of different oligonucleotide primers are utilized. For 
55 example, if two different specific nucleic acid sequences are to be produced, four primers are utilized. Two 
of the primers are specific for one of the specific nucleic acid sequences and the other two primers are 
specific for the second specific nucleic acid sequence. In this manner, each of the two different specific 
sequences can be produced exponentially by the present process. 
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The amplification process can be performed in a step- wise fashion where after each step new reagents 
are added, or simultaneously, where all reagents are added at the initial step, or partially step-wise and 
partially simultaneous, where fresh reagent is added after a given number of steps. If a method of strand 
separation, such as heat, is employed which will inactivate the inducing agent, as in the case of a heat-labile 

5 enzyme, then it is necessary to replenish the agent for polymerization after every strand separation step. 
The simultaneous method may be utilized when an enzymatic means is used for the strand separation step. 
In the simultaneous procedure, the reaction mixture may contain, in addition to the nucleic acid strand (s) 
containing the desired sequence, the strand-separating enzyme (e.g., helicase), an appropriate energy 
source for the strand-separating enzyme, such as rATP, the four nucleotides, the oligonucleotide primers in 

10 molar excess, and the inducing agent, e.g., Klenow fragment of E. coli DNA Polymerase I. 

If heat is used for denaturation in a simultaneous process, aTieaFitable enzyme such as a thermostable 
polymerase may be employed which will operate at an elevated temperature, preferably 65-90° C depend- 
ing on the agent for polymerization, at which temperature the nucleic acid will consist of single and double 
strands in equilibrium. For smaller lengths of nucleic acid, lower temperatures of about 50* C may be 

75 employed. The upper temperature will depend on the temperature at which the enzyme will degrade or the 
temperature above which an insufficient level of primer hybridization will occur. Such a heat-stable enzyme 
is described, e.g., by A. S. Kaledin et al., Biokhimiya, 45, 644-651 (1980). Each step of the process will 
occur sequentially notwithstanding the initial presence "of all the reagents. Additional materials may be 
added as necessary. After the appropriate length of time has passed to produce the desired amount of the 

20 specific nucleic acid sequence, the reaction may be halted by inactivating the enzymes in any known 
manner or separating the components of the reaction. 

In an alternative method using a thermostable enzyme, the primers, enzyme and nucleotide 
triphosphates are contacted with the nucleic acid sample. Thereafter, the mixture is treated to denature the 
nucleic acids and then incubated at a temperature at which the primers can hybridize to complementary 

25 sequences In the nucleic acid sample. The mixture is then heated for an effective time and at an effective 
temperature to promote the activity of the enzyme, and to synthesize, for each different sequence being 
amplified, an extension product of each primer which is complementary to each nucleic acid strand 
template, but not so high as to separate each extension product from its complementary strand template. 
Next, the mixture is heated for an effective time and at an effective temperature to separate the primer 

30 extension products from the templates on which they were synthesized to produce single-stranded 
molecules, but not so high as to denature the enzyme irreversibly. The mixture is then cooled for an 
effective time and to an effective temperature to promote hybridization of each primer to each of the single- 
stranded molecules produced in the preceding step. Finally, the mixture is heated for an effective time and 
to an effective temperature to promote the activity of the enzyme and to synthesize, for each different 

35 sequence being amplified, an extension product of each primer which is complementary to and hybridized 
to a new nucleic acid strand template produced as a primer extension product, but not so high as to 
separate each extension product from its complementary strand template, where the last two steps may be 
carried out simultaneously or sequentially. 

A preferred thermostable enzyme which may be employed in the process herein is extracted and 

40 purified from Thermus aquaticus and has a molecular weight of about 86.000-90,000 daltons. This enzyme 
is more fully described in Example VIII hereinbelow. 

The process may be conducted continuously. In one preferred embodiment of an automated process 
wherein a thermostable enzyme is employed, the reaction may be cycled through a denaturing region, a 
primer annealing region, and a reaction region. In another embodiment, the enzyme used for the synthesis 

45 of primer extension products can be immobilized in a column. The other reaction components can be 
continuously circulated by a pump through the column and a heating coll in series, thus the nucleic acids 
produced can be repeatedly denatured without inactivating the enzyme. 

In one preferred embodiment even where a thermostable enzyme is not employed and the temperature 
is raised and lowered, one such instrument is an automated machine for handling the amplification reaction 

so of this invention. Briefly, this instrument utilizes a liquid handling system under computer control to make 
liquid transfers of enzyme stored at a controlled temperature in a first receptacle into a second receptacle 
whose temperature is controlled by the computer to conform to a certain incubation profile. The second 
receptacle stores the nucleic acid sequence(s) to be amplified plus the nucleotide triphosphates and 
primers. The computer includes a user interface through which a user can enter process parameters which 

55 control the characteristics of the various steps in the amplification sequence such as the times and 
temperatures of incubation, the amount of enzyme to transfer, etc. 

A preferred machine which may be employed which is specifically adapted for use with a thermostable 
enzyme utilizes temperature cycling without a liquid handling system because the enzyme need not be 
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transferred at every cycle. Briefly, this instrument consists of the following systems: 

1. A heat-conducting container for holding a given number of tubes, preferably 500 ul tubes, which 
contain the reaction mixture of nucleotide triphosphates, primers, nucleic acid sequences, and enzyme. 

2. A means to heat, cool, and maintain the heat-conducting container above and below room tempera- 
5 ture, which means has an input for receiving a control signal for controlling which of the temperatures at 

or to which the container is heated, cooled or maintained. (This may be Peltier heat pumps available 
from Materials Electronics Products Corporation in Trenton, N.J. or a water heat exchanger.) 

3. A computer means (e.g., a microprocessor controller), coupled to the input of said means, to generate 
the signals which control automatically the amplification sequence, the temperature levels, and the 

w temperature ramping and timing. 

The amplification process is demonstrated diagrammatically below where double-stranded DNA contain- 
ing the desired sequence [S] comprised of complementary strands [s + ] and [S~] is utilized as the nucleic 
acid. During the first and each subsequent reaction cycle extension of each oligonucleotide primer on the 
original template will produce one new ssDNA molecule product of indefinite length which terminates with 
is only one of the primers. These products, hereafter referred to as "long products," will accumulate in a linear 
fashion; that is, the amount present after any number of cycles will be proportional to the number of cycles. 

The long products thus produced will act as templates for one or the other of the oligonucleotide 
primers during subsequent cycles and will produce molecules of the desired sequence [S*] or [S~] These 
molecules^ will also function as templates for one or the other of the oligonucleotide primers, producing 
20 further [S ] and [S~], and thus a chain reaction can be sustained which will result in the accumulation of [S] 
at an exponential rate relative to the number of cycles. 

By-products formed by oligonucleotide hybridizations other than those intended are not self-catalytic 
(except in rare instances) and thus accumulate at a linear rate. 

The specific sequence to be amplified, [S], can be depicted diagrammatically as: 



CS^D 5 1 AAAAAAAAAAXXXXXXXXXXCCCCCCCCCC 3 1 

CS~ J 3 1 TTTTTTTTTTYYY YYYY Y YYGGGGGGGGGG 5 1 

30 

The appropriate oligonucleotide primers would be: 



35 



40 



Primer 1: GGGGGGGGGG 
Primer 2: AAAAAAAAAA 



so that if DNA containing [S] 



zzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz 

....zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz'.W* 

45 is separated into single strands and its single strands are hybridized to Primers 1 and 2, the following 
extension reactions can be catalyzed by DNA polymerase in the presence of the four deoxyribonucleoside 
triphosphates: 



50 



55 
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3' 5' 

extends^— ■ 6G66GGGGG6 Primer 1 

. . . .zzzzzzzzzzzzzzz2AAAAMAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzz2Z22222zz 
original template strand 

original tenplate strand" 

zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzz2ZZZZ2Z2ZZ2ZZ 

Primer 2 AAAAAAAAAA >extends 

5' 3' 



75 On denaturation of the two duplexes formed, the products are: 

zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 

newly synthesized long product 1 



20 



25 



5' 3 . 

zzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzz22ZZZZZ 

original tenplate strand* 

30 3' 5* 

.•..zzzzzzzzzzzzzzzzTTTTTTTnTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz 

original template strand" 



35 



40 



5' 3' 
AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz..^ 
newly synthesized long product 2 

If these four strands are allowed to rehybridize with Primers 1 and 2 In the next cycle, the agent for 
polymerization will catalyze the following reactions: 

45 Primer 2 5' AAAAAAAAAA — ^extends to here 

3* zzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 

newly synthesized long product 1 



50 



extends f _ GGGGGGGGGG 5' Primer 1 



55 
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5 l zzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzz 3' 

original template strand"*" 

5 Primer 2 5' AAAAAAAAAA ^ extends 

3' zzzzzzzzzzzzzzzzzzTTTTTTTTTTmYYYyYY6GSGGGG6GGzzzzzzzzzz....5 , 

original terrplate strand" 

w extends to here{ GGGGGGGGGG 5* Primer 1 

5' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz. .3' 
newly synthesized long product 2 

is If the strands of the above four duplexes are separated, the following strands are found: 

5 1 AAAAAAAAAAXXXXXXXXXXCCCCCCCCCC 3* 
newly synthesized [S ] 



20 



3' zzzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 

first cycle synthesized long product 1 

3' zzzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5* 

25 newly synthesized long product 1 

5 1 zzzzzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzz 3' 

original template strand 

30 5 1 AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz . . .3 1 

newly synthesized long product 2 

3* ..zzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz^.B 4 
original template strand" 

35 

3' TTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 
newly synthesized [S"] 

5 1 AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzz. . .3 1 
^0 first cycle synthesized long product 2 

It is seen that each strand which terminates with the oligonucleotide sequence of one primer and the 
complementary sequence of the other is the specific nucleic acid sequence [S] that is desired to be 
45 produced. 

The steps of this process can be repeated indefinitely, being limited only by the amount of Primers 1 
and 2, agent for polymerization and nucleotides present The amount of original nucleic acid remains 
constant in the entire process, because it is not replicated. The amount of the long products increases 
linearly because they are produced only from the original nucleic acid. The amount of the specific sequence 
so increases exponentially. Thus, the specific sequence will become the predominant species. This is 
illustrated in the following table, which indicates the relative amounts of the species theoretically present 
after n cycles, assuming 100% efficiency at each cycle: 



55 
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Number of Double Strands 
After 0 to n Cycles 

Long Specific 
Cycle Number Template Products Sequence [S] 



0 
1 
4 
26 
1013 
32,752 
1,048,555 
(2 n -n-l) 





0 


1 






1 


1 


1 


10 


2 


1 


2 




3 


1 


3 




5 


1 


5 




10 


1 


10 




15 


1 


15 


IS 


20 


1 


20 




n 


1 


n 



When a single-stranded nucleic acid is utilized as the template, only one long product is formed per cycle. 

20 The desired amount of cycles of this reaction will depend on, e.g., the nature of the sample. Fewer 
cycles will be required if the sample being analyzed is pure. If the sample is a complex mixture of nucleic 
acids, more cycles will be required to amplify the signal sufficiently for it to be detected by the method 
herein. For human genomic DNA preferably 15-30 cycles are carried out to amplify the sequence 
sufficiently that a clearly detectable signal is produced (i.e., so that background noise does not interfere with 

25 detection). 

In one embodiment of the invention herein, the amplified sample suspected of containing the sequence 
variation, whether resulting from cancer, an infectious disease, a genetic disease, or just normal genetic 
polymorphism, is spotted directly on a series of membranes and each membrane is hybridized with a 
different labeled sequence- specific oligonucleotide probe. One procedure for spotting the sample on a 

30 membrane is described by Kafotos et al., Nucleic Acids Research, 7:1541-1552 (1979). 

Briefly, the DNA sample affixed to the membrane may be pretreated with a prehybridization solution 
containing sodium dodecyl sulfate, Ficoll, serum albumin and various salts prior to the probe being added. 
Then, a labeled oligonucleotide probe which is specific to each sequence to be detected is added to a 
hybridization solution similar to the prehybridization solution. The hybridization solution is applied to the 

35 membrane and the membrane is subjected to hybridization conditions that will depend on the probe type 
and length, type and concentration of ingredients, etc. Generally, hybridization is carried out at about 25- 
75° C, preferably 35 to 65* C, for 0.25-50 hours, preferably less than three hours. The greater the stringency 
of conditions, the greater the required complementarity for hybridization between the probe and sample. If 
the background level is high, stringency may be increased accordingly. The stringency can also be 

40 incorporated in the wash. 

After the hybridization the sample is washed of unhybridized probe using any suitable means such as 
by washing one or more times with varying concentrations of standard saline phosphate EDTA (SSPE) (180 
mM NaCI, 10 mM NaCI, 10 mM NaHPO* and 1 H EDTA, pH 7.4) solutions at 25-75* C for about 10 minutes 
to one hour, depending on the temperature. The label is then detected by using any appropriate detection 

45 techniques. 

The sequence-specific oligonucleotides of the probe groups of this invention are oligonucleotides which 
are generally prepared and selected as described above for preparing and selecting the primers The 
sequence-specific oligonucleotides must encompass the region of the sequence which spans the nucleotide 
variations being detected and must but specific for the nucleotide variations being detected. For example, if 

so it is desired to detect whether a sample contains the mutation for sickle cell anaemia one oligonucleotide 
will be prepared which contains the nucleotide sequence site characteristic of the normal 0-globin gene, and 
one oligonucleotide will be prepared which contains the nucleotide sequence characteristic of the sickle cell 
allele. Each oligonucleotide would be hybridized to duplicates of the same sample to determine whether the 
sample contains the mutation. 

55 The polymorphic areas of HLA class II genes are localized to specific regions of the second exon and 
are flanked by conserved sequences, so that oligonucleotide primers complementary to opposite strands of 
the conserved 5' and 3' ends of the second exon can be prepared. 

The number of oligonucleotides employed for detection of the polymorphic areas of the HLA class II 
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genes will vary depending on the type of gene, which has regions of base pair variation which may be 
clustered or spread apart. If the regions are clustered, as in the case with HLA-DQa, then one 
oligonucleotide is employed for each allele. If the regions are spread apart, as in the case with HLA-DQ0 
and HLA-DRj8. then more than one probe, each encompassing an allelic variant, will be used for each allele. 
5 In the case of HLA-DQjS and HLA-DR0, three probes are employed for the three regions of the locus where 
allelic variations may occur. For detection of sequences associated with insulin-dependent diabetes meliitus 
(IDDM) four probes for the HLA-DRjS second exon are employed. 

Hapiotypes can be inferred from segregation analysis in families or, in some cases, by direct analysis of 
the individual DNA sample. Specific allelic combinations (hapiotypes) of sequence-specific oligonucleotide 

10 reactivities can be identified in heterozygous cells by using restriction enzyme digestion of the genomic 
DNA prior to amplification. 

For example, if in DQ/3 one finds three highly variable subregions A. B, and C within a single amplified 
region, and if there are six different sequences at each region (A1-6, B1-6, C1-6), then an individual could 
be typed in the DQjS locus by sequence-specific oligonucleotide probe analysis as containing Al. A2; B2, 

15 B3; C1 t C4, with the possible haplotype combinations of A1, B2 t CI; A1, B2, C4; A2, B2, CI; A2, B2, C4; 
Al, B3, C1; A1, B3. C4; A1, B2, C1; and A1, B2, C4. 

If the genomic DNA is digested with a polymorphic restriction enzyme prior to amplification, and if the 
enzyme cuts both alleles between the primers, there is no reactivity with the sequence-specific probes due 
to lack of amplification, and the result is uninformative. If the enzyme cuts neither allele, the probe results 

20 with digested and undigested genomic DNA are the same and the result is uninformative. If the enzyme 
cuts only one allele, however, then one can infer both hapiotypes by comparing the probe reactivity patterns 
on digested and undigested DNA. 

The hapiotypes can be deduced by comparing sequence-specific oligonucleotide reactivities with uncut 
genomic DNA and genomic DNA cut with one or several enzymes known to be polymorphic and to 

25 recognize sites between the primers. 

The length of each sequence-specific oligonucleotide will depend on many factors, including the 
particular target molecule being detected, the source of oligonucleotide, and the nucleotide composition. For 
purposes herein, each probe typically contains 15-25 nucleotides, although it may contain more or fewer 
nucleotides. While oligonucleotides which are at least 19-mers in length may enhance specificity and/or 

30 sensitivity, probes which are less than 19-mers, e.g.. 16-mers, show more sequence-specific discrimination, 
presumably because a single mismatch is more destabilizing. Because amplification increases specificity so 
that a longer length is less critical, and hybridization and washing temperatures can be lowered for the 
same salt concentration, it is preferred to use probes which are less than 19-mers. 

Where the sample is first placed on the membrane and then detected with the oligonucleotides, each 

35 oligonucleotide must be labeled with a suitable label moiety, which may be detected by spectroscopic, 
photochemical, biochemical, immunochemical or chemical means. Immunochemical means include anti- 
bodies which are capable of forming a complex with the oligonucleotide under suitable conditions, and 
biochemical means include polypeptides or lectins capable of forming a complex with the oligonucleotide 
under the appropriate conditions. Examples include fluorescent dyes, electron-dense reagents, enzymes 

40 capable of depositing insoluble reaction products or being detected chronogenically, such as alkaline 
phosphatase, a radioactive label such as K P, or biotin. If biotin is employed, a spacer arm may be utilized 
to attach it to the oligonucleotide. 

Alternatively, in one "reverse" dot blot format, at least one of the primers and/or at least one of the four 
nucleotide triphosphates is labeled with a detectable label, so that the resulting amplified sequence is 

45 labeled. These labeled moieties may be present initially in the reaction mixture or added during a later 
cycle. Then unlabeled sequence-specific oligonucieotidescapable of hybridizing with the amplified nucleic 
acid sequence, if the variation(s) in sequence (whether normal or mutant) is/are present, are spotted on 
(affixed to) membrane under prehybridization conditions as described above. The amplified sample is then 
added to the pretreated membrane under hybridization conditions as described above. Finally, detection 

so means are used to determine if an amplified sequence in the nucleic acid sample has hybridized to 
oligonucleotide affixed to the membrane, Hybridization will occur only if the membrane-bound sequence 
containing the variation is present in the amplification product i.e., only if a sequence of a probe is 
complementary to a region of the amplified sequence. 

In another version of the "reverse" dot blot format, the amplification is carried out without employing a 

55 label as with the "forward" dot blot format described above, and labeled sequence-specific oligonucleotide 
probes capable of hybridizing with the amplified nucleic acid sequence containing the variation, if present, 
are spotted on (affixed to) membrane under prehybridization conditions as described above. The amplified 
sample is then added to 
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pretreated membrane under hybridization conditions as described above. Then labeled oligonucleotide 
or a fragment thereof is released from the membrane in such a way that a detection means can be used to 
determine if an amplified sequence in the sample hybridized to 

labeled oligonucleotide. The release may take place, for example, by adding a restriction enzyme to the 
5 membrane which recognizes a restriction site in the probe. This procedure, known as oligomer restriction, is 
described more fully in EP -A-1 64,054. 

For purposes of this invention, the genetic diseases which may be detected include specific deletions, 
insertions and/or substitutions in any base pair mutation or polymorphism in nucleic acids, for example, 
genomic DNA, from any organism. Examples of diseases in which a base pair variation is known include 
w sickle cell anemia, hemoglobin C disease, a-thalassemia, ^-thalassemia, and the like. Other diseases that 
may be detected include cancerous diseases such as those involving the RAS oncogenes, e.g., the n-RAS 
oncogene, and infectious diseases. 

The present probes may be used for HLA typing in the areas of tissue transplantation, disease 
susceptibility, and paternity determination. The HLA class II genes, consisting of the a and 0 genes from 
75 the HLA-DR, HLA-DQ and H LA-DP regions, are highly polymorphic; their genetic complexity at the DNA 
level is significantly greater than the polymorphism currently defined by serological typing. In addition, the 
process may be employed to detect certain DNA sequences coding for HLA class II 0 proteins (e.g., DH0) 
associated with insulin-dependent diabetes mellitus (IDDM). Briefly, the four DNA sequences associated 
with IDDM are selected from the group consisting of: 

20 

1) 5 1 -GAGCTGCGTAAGTCTGAG-3 1 f 

2) 5 1 -GAGGAGTTCCTGCGCTTC- 3 ' . 

3) 5 1 -CCTGTCGCCGAGTCCTGG-3' f and 
25 4)5' -GACATCCTGGAAGACGAGAGA-3' , 



or the DNA strands that are complementary thereto. Sequence-specific probes may be prepared that will 
hybridize to one or more of these sequences. 
30 The following examples illustrate various aspects of the invention and are not intended to be limiting in 
any respect. In the examples all parts and percentages are by weight if solid and by volume if liquid, and all 
temperatures are in degrees Celsius, unless otherwise indicated. 



EXAMPLE I 

35 

This example illustrates how the process herein can be used to distinguish normal alleles (A) from 
sickle cell alleles (S) from Hemoglobin C disease alleles (C). 

I. Synthesis of the Primers 

40 

The following two oligonucleotide primers were prepared by the method described below: 



5' -ACACAACTGTGTTCACTAGC-3 < (PC03) 
5 1 -CAACTTCATCCACGTTCACC- 3 1 ( PC04 ) 



These primers, both 20-mers, anneal to opposite strands of the genomic DNA with their 5' ends separated 
by a distance of 1 1 0 base pairs. 

so A. Automated Synthesis Procedures: The diethyiphosphoramidites, synthesized according to Beaucage 
and Caruthers (Tetrahedron Letters (1981) 22:1859-1862) were sequentially condensed to a nucleoside 
derivatized controlled pore glass support. The procedure included detritylation with trichloroacetic acid in 
dichloromethane, condensation using benzotriazole as activating proton donor, and capping with acetic 
anhydride and dimethylaminopyridine in tetrahydrofuran and pyridine. Cycle time was approximately 30 

55 minutes. Yields at each step were essentially quantitative and were determined by collection and 
spectroscopic examination of the dimethoxytrityl alcohol released during detritylation. 
B. Oligodeoxyribonucleotide Deprotection and Purification Procedures: The solid support was removed 
from the column and exposed to 1 ml concentrated ammonium hydroxide at room temperature for four 
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hours in a closed tube. The support was then removed by filtration and the solution containing the 
partially protected oligodeoxy nucleotide was brought to 55* C for five hours. Ammonia was removed and 
the residue was applied to a preparative poly aery lamide gel. Electrophoresis was carried out at 30 
volts/cm for 90 minutes after which the band containing the product was identified by UV shadowing of a 

s fluorescent plate. The band was excised and eluted with 1 ml distilled water overnight at 40* C. This 
solution was applied to a column and eluted with a 7-13% gradient of acetonitrile in 1% ammonium 
acetate buffer at pH 6.0. The elution was monitored by UV absorbance at 260 nm and the appropriate 
fraction collected, quantitated by UV absorbance in a fixed volume and evaporated to dryness at room 
temperature in a vacuum centrifuge. 

w C. Characterization of Oligodeoxyribonucleotides: Test aliquots of the purified oligonucleotides were 32 P 
labeled with polynucleotide kinase and 7 - 32 P-ATP. The labeled compounds were examined by auto- 
radiography of 14-20% polyacry lamide gels after electrophoresis for 45 minutes at 50 volts/cm. This 
procedure verifies the molecular weight. Base composition was determined by digestion of the 
oligodeoxyribonucleotide to nucleosides by use of venom diesterase and bacterial alkaline phosphatase 

is and subsequent separation and quantitation of the derived nucleosides using a reverse phase HPLC 
column and a 10% acetonitrile, 1% ammonium acetate mobile phase. 

II. Isolation of Human Genomic DNA from Cell Line 

20 High molecular weight genomic DNA was isolated from the lymphoid cell line GM2064 using essentially 
the method of Maniatis et al., Molecular Cloning (1982), 280-281. GM2064 (Human Mutant Cell Repository, 
Camden, N.J.) was originally isolated from an individual homozygous for hereditary persistance of fetal 
hemoglobin (HPFH) and contains no /S- or 5-globin gene sequences. This cell line was maintained in RPMI- 
1640 with 10% fetal calf serum. 



III. Isolation of Human Genomic DNA from Clinical Samples 

Five clinical blood samples designated AA (from a known normal individual), AS (from a known sickle 
cell carrier), SS (from a known sickle cell individual), SC (from a known sickle cell/hemoglobin C diseased 
30 individual), and AC (from a known hemoglobin C disease carrier) were obtained from Dr. Bertram Lubin of 
Children's Hospital in Oakland, California. One clinical DNA sample designated CC (from a known 
hemoglobin C diseased individual) was obtained from Dr. Stephen Embury of San Francisco General 
Hospital in San Francisco, California. 

Genomic DNA from the first five of these samples was prepared from the buffy coat fraction, which is 
35 composed primarily of peripheral blood lymphocytes, as described by Saiki et al., Biotechnology 3-1008- 
1012 (1985). 

IV. Amplification Reaction 

40 One microgram of DNA from each of the seven DNA samples (10 ul of 100 ug/ml DNA) was amplified 
in an initial 100 ul reaction volume containing 10 ul of a solution containing 100 mM Tris'HCI buffer (pH 
7.5), 500 mM NaCI, and 100 mM MgCI 2 . 10 ul of 10 uM of primer PC03, 10 ul of 10 uM of primer PC04, 
15 ul of 40 mM dNTP (contains 10 mM each of dATP, dCTP, dGTP and TTP), and 45 ul of water. 

Each reaction mixture was held in a heat block set at 95* C for 10 minutes to denature the DNA. Then 

45 each DNA sample underwent 25 cycles of amplification where each cycle was composed of four steps: 

(1) spin briefly (10-20 seconds) in microcentrifuge to pellet condensation and transfer the denatured 
material immediately to a heat block set at 30* C for two minutes to allow primers and genomic DNA to 
anneal, 

(2) add 2 ul of a solution prepared by mixing 39 ul of the Klenow fragment of E. coli DNA Polymerase I 
so (New England Biolabs. 5 units'ul), 39 ul of a salt mixture of 100 mM Tris buffeT (pFT7.5). 500 mM NaCI 

and 100 mM MgCI 2 , and 312 ul of water, 

(3) allowing the reaction to proceed for two minutes at 30* C, and 

(4) transferring the samples to the 95* C heat block for two minutes to denature the newly synthesized 
DNA, except this reaction was not carried out at the last cycle. 

55 The final reaction volume was 150 ul. and the reaction mixture was stored at -20* C. 

V. Synthesis and Phosphorylation of Oligodeoxyribonucleotide Probes 



25 
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Three labeled DNA probes, designated RS17, RS18 and RS21, of the following sequences were 
prepared as follows: 



5' CTCCTAAGGAGAAGTCTGC- 3 1 (RS17) 
5 1 C TCCTGAGGAGAAGTCTGC- 3 1 (RSI 8) 
5 ' CTCCTGTGGAGAAGTCTGC- 3 1 (RS21 ) 



70 where * indicates the label. These probes are 19 bases long and span the fifth through eleventh codons of 
the gene. RS18 is complementary to the normal £-globin allele (0 A ), RS21 to the sickle cell anemia allele 
(0 s ), and RS17 to the hemoglobin C disease allele (0°). RS17 and RS21 differ from RS18 by a single base. 
The schematic diagram of primers and probes is given below: 



75 



110 bp 



B-globin 



PC03 



20 



"RSIT 
RS18 
RS21 



~PC0T 



These three probes were synthesized according to the procedures described in Section I. The probes 
were labeled by contacting 10 pmole thereof with 4 units of T4 polynucleotide kinase (New England 

25 Biolabs) and about 40 pmole 7 - 32 P-ATP (New England Nuclear, about 7000 Ci/mmole) in a 40 ul reaction 
volume containing 70 mM Tris buffer (pH 7.6), 10 mM MgCI 2 . 1.5 mM spermine, 100 mM dithiothreitol and 
water for 60 minutes at 37° C. The total volume was then adjusted to 100 ul with 25 mM EDTA and purified 
according to the procedure of Maniatis et al., Molecular Cloning (1982), 466-467 over a 1 ml Bio Gel P-4 
(BioRad) spin dialysis column equilibrated with Tris- EDTA (TE) buffer (10 mM Tris buffer, 0.1 mM EDTA, 

30 pH 8.0). TCA precipitation of the reaction product indicated that for RS17 the specific activity was 5.2 
uCi/pmole and the final concentration was 0.118 pmole/ul. For RS18 the specific activity was 4.6 uCi/pmole 
and the final concentration was 0.114 pmole/ul. For RS21 the specific activity was 3.8 uCi/pmole and the 
final concentration was 0.112 pmole/ul. 

35 VI. Dot Blot Hybridizations 



Five microliters of each of the 150 ul amplified samples from Section III was diluted with 195 ul 0.4 N 
NaOH, 25 mM EDTA and spotted onto three replicate cationic nylon filters by first wetting the filter with 
water, placing it in an apparatus for preparing dot blots which holds the filter in place, applying the samples, 

40 and rinsing each well with 0.4 ml of 20 x SSPE (3.6 M NaCI, 200 mM NahfePO*. 20 mM EDTA), as 
disclosed by Reed and Mann, Nucleic Acids Research, 13, 7202-7221 (1985). The filters were then 
removed, rinsed in 20 x SSPE, and baked for 30 minutes at 80 C in a vacuum oven. 

After baking, each filter was then contacted with 6 ml of a hybridization solution consisting of 5 x SSPE, 
5 x Denhardt's solution (1 x = 0.02% polyvinylpyrrolidone, 0.02% Ficoll, 0.02% bovine serum albumin, 0.2 

45 mM Tris'HCI, 0.2 mM EDTA, pH 8.0) and 0.5% SDS and incubated for 60 minutes at 55* C. Then 5 ul 
each of probes RS17. RS18 and RS21 was added to the hybridization solution and the filter was incubated 
for 60 minutes at 55* C. 

Finally, each hybridized filter was washed twice with 100 ml 2 x SSPE and 0.1% SDS for 10 minutes at 
room temperature. As a third wash for RS17, 250 ml of 5 x SSPE and 0.1% SDS was added and the filter 

so heated for five minutes at 55* C. For RS18 and 21, the filters were treated with 250 ml of 4 x SSPE, 0.1% 
SDS for five minutes at 55 *C. There was a faint background with RS18 and RS21 because 4 x SSPE at 
55* C was not sufficiently stringent The washing with the RS18 and RS21 probes was repeated in 250 ml of 
5 x SSPE, 0.1% SDS for three minutes at 55 *C. There was no change in the background. The wash was 
repeated with 5 x SSPE at 60* C for one minute and an additional wash of the same stringency was done 

55 for three minutes. This resulted in virtually no background. The genotypes were readily apparent after 90 
minutes of autoradiography. 



VII. Discussion of Autoradiogram 
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The autoradiogram of the dot blot of the seven amplified genomic DNA samples hybridized with allele- 
specific jS-globin probes RS18, RS21 and RS17 was analyzed after 12 hours. The negative control GM2064 
was included. The results clearly indicate that each allele-specific probe annealed only to the DNA samples 
which had at least one copy of the 0-globin allele to which it was perfectly matched. For example, the 0 A - 
5 specific probe, RS18. hybridized only to samples AA (/S A ;S A ), AS (/9 A 0 S ), and AC (fi A 0 c ). 

EXAMPLE II 

To determine the minimum levels of detection by dot blot, eight serial dilutions containing 128. 64, 32, 
10 16, 8, 4, 2 and 1 ng of normal genomic DNA were made from sample AA and subjected to 25 cycles of 
amplification as described in Example !. 

As controls, the amplified samples of AA and SS from Example I were similarly diluted as well. 
A total of 75 u\ (one-half) of each sample was mixed with 125 ill of 0.65 N NaOH and 25 mM EDTA and 
the mixture was applied to a nylon filter. Then the filter was rinsed in 20 x SSPE and baked for 30 minutes 
is at80*C. 

The filter was then probed as described in Example I with RS18 (the 0 A probe) to determine the 
detection threshold. The prehybridization solution was 8 ml of 5 x SSPE. 5 x Denhardt's solution. 0.5% SDS 
for 40 minutes at 55* C and the hybridization solution was the same plus 10 ul of RS18 for 80 minutes at 
55* C. The filters were then washed with 2 x SSPE. 0.1% SDS for 10 minutes at room temperature twice 

20 and then with 5 x SSPE, 0.1 % SDS for three minutes at 60 ° C. 

The autoradiogram obtained after hybridization with RS18 after 17 hours of exposure revealed positive 
signals in all samples containing the AA DNA. The SS sample was visible after 17 hours but the intensity of 
the 64 ng SS was equivalent to the intensity of 1 ng AA, which is a signal-to-noise ratio of 64:1. The 
intensity of the signal present in the 0.5 ng spot suggested that amplification of samples containing 

25 significantly less than 1 ng is possible. (One nanogram is the amount of genomic DNA present in 150 
diploid ceils.) 

EXAMPLE III 

30 A. Amplification and Detection of HLA-DQa Sequences 
I. Preparation of Primers 

Oligonucleotides designated GH26 and GH27 complementary to opposite strands of the conserved 5* 
35 and 3' ends of the DQa second exon were used as primers to amplify a 240 base pair fragment The 
primers, having the following sequences, were prepared as described in Example I. 

5 1 -GTGCTGCAGGTGTAAACTTGTACCAG-3' (GH26) 
m 5' -CACGGATCCGGTAGCAGCGGTAGAGTTG-3 1 (GH27) 



II. Preparation of Probes 

45 Based on the analysis of HLA-DQa sequences from diverse sources, which were grouped into allelic 
variants, the following probes from variable regions of the DQa second exon encompassing each variant 
were synthesized and labeled as described in Example I. The two variable regions of the HLA-DQa second 
exon (called "exon T), segments A and B, are shown in Table I. The entire scheme of primers (designated 
by PCR— ), probes and HLA-DQa sequence is shown in Table II, and the amino acid abbreviations used 

so therein are shown in Table III. 
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TABLE I 

HLA DQo (segment A): 

5 



DRl,2,w6 
DR5,8 
DR4,7,9 
0R3 



HLA DQa (segment B): 

15 

45 50 55 60 

AlaTrpArgTrpProGluPbeSerLysPheGlyGlyPheAspProGl nGly 
GCCTGGCGGTGGCCTGAGTTC AGCAAATTTGGAGGTTTTGACCCGCAGGGT DR1 , 2 , 5 , w6 . 8 

-T A— T CT C G A— A-A ATT- DR4.9 

20 -T AA— T CT CA—G-C—A -A ATT- 0R7 

-T T— T-T TTC AC A -A ATT- OR 3 

DXA: -T A--T AT T AT- A — ■ A — 



25 



30 



35 



40 



45 



50 



35 40 
GlyAspGluGluPheTyrValAspLeuGluArglysGlu 
GGAGATGAGGAGTTCTACGTGGACCTGGAGAGGAAGGAG 
— — C A 

C G 

DXA: C T C A — 



17 



EP 0 459 533 A2 



OOOOOOOOOOQQO 



CO 

00 a* 



Of 

-J 



cd 
to 

U- 



LU 
O UJ 



1 1 1 1 1 1 

>■ >» -j .j -j 

1/1 CO 



111!!! 

W7777 

777777 

1 Till 

or or or or or or 
or or or 
i Ti -J i i 
or or or or o cr 
or or or 2f or oc 

• • t « — j _j 
—J — f — t > 

t i i i i i 
—» -j _j -j -j _j 
cr o 1 cr ^ o cj 

• i i i i i 



aoaaaa 



cd 
o 



CD 

CO , 

a. 
o CD 



CD 
CJ 
CO 

<c 



CD CD 



to CO to u. 



co to to to co to 



>->->->- 



uacjy<a<<u<io<:u 
oQa-acoLui-u^oQCJx 

O Q Q Q Q o 



18 




EP 0 459 533 A2 



TABLE III 



Amino Acid Abbreviation Codes 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic Acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glutamic Acid 


Glu 


E 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoieucine 


lie 


1 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 



The probes are as follows, where * is the label. 



5 1 TGTTTGCCTGTTCTCAGAC- 3 1 (GH66) 

5 1 TTCCGC AGATTTAGAAGAT- 3 1 (GK67) 

5» TTCCACAGACTTAGATTTG-3' (GH68) 

5 1 CTC AGGCCACCGCCAGGC A- 3 1 (GH75) 



The GH75 probe was derived from the DRw6 sequence, which is described by Auffray et ai., Nature, 
308:327 (1984). The GH67 probe was derived from the DR4 sequence, which is described by Auffray et al., 
PNAS , 79:6337 (1982). The GH68 probe was derived from the DR7 sequence which is described by Chang 
et al., Nature , 305:813 (1983). The GH66 probe was derived from the DR3 sequence, which is described by 
Schenning et al., EMBO J., 3:447 (1984). A control oligonucleotide was derived from a conserved segment 
of all of these alleles. 

HI. Origin and Preparation of Genomic DNA 



4s Eleven DNA samples described below were prepared for subsequent amplification as described in 
Example I. 

LG2 cell line (genotype DR1) from Drs. John Bell and Dan Denny of Stanford University, Stanford, 
California 

PGF cell line (genotype DR2) from Drs. John Bell and Dan Denny 
5q AVL cell line (genotype DR3) from Drs. John Bell and Dan Denny 
^ DKB cell line (genotype DR4) from Drs. John Bell and Dan Denny 

JGL cell line (genotype DR5) from Dr. Gerry Nepom of Virginia Mason Hospital, Seattle, Washington 

APD cell line (genotype DR6) from Drs. John Bell and Dan Denny 

LBF cell line (genotype DR7) from Drs. John Bell and Dan Denny 
5s TAB cell line (genotype DR8) from Dr. Gerry Nepom 

KOZ cell line (genotype DR9) from Dr. Gerry Nepom 

Sample GM2741A (genotype DR1,3) available from the Human Genetic Mutant Cell Repository, 
Camden, N.J. 

Sample GM2676 (genotype DR4.3) available from the Human Genetic Mutant Cell Repository 
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IV. Amplification Reaction 

Each genomic DNA sample was amplified as described in Example I using GH26 and GH27 as primers 
for 28 cycles, except that the polymerization step 3 was carried out at 37° C in the presence of 10% by 
5 weight dimethylsu If oxide (DMSO), and except that the amplification reaction took place in the aluminum 
beating block of the above-described automated liquid handling temperature-cycling instrument using the 
following program: 

1) 2.5 min., 37* C to 98] C ramp (denature); 

2) 3.0 min., 98 # C to 37* C ramp (anneal); 
w 3) add 1 unit Klenow fragment; and 

4) 2.0 min., 37* C maintain (extend). 

V. Dot Blot Hybridization 

75 For each DNA sample, four duplicate nylon filters were spotted with 5 ul of the 150 jxl amplified 
genomic DNA and one of probes GH66, 67, 68 or 75 was applied thereto as described in Example I except 
that DNA samples were neutralized before application to a nylon filter membrane, using a prehybridization 
solution of 6 x SSPE, 10 x Denhardt's solution and 0.5% SDS for one hour at 50* C and the same solution 
overnight at 50 *C. The filters were washed with 0.1 x SSPE, 0.1% SDS for 10-15 minutes at 37* C. The 

20 filters were treated as described in Example I to obtain an autoradiogram. 

VI. Discussion of Autoradiogram 

The autoradiogram of the dot blot shows that the four HLA-DQa allele-specific probes. GH66. GH67, 
25 GH68, and GH75 complementary to four HLA-DQa allelic variants, may be used to define nucleotide 
sequence polymorphisms on amplified DNA from both homozygous and heterozygous individuals. The 
pattern of reactivity of the probe GH75 corresponds to the serologically defined type DQw1. 

B. Amplification and Detection of HLA-DQtf Sequences 

30 

I. Preparation of Primers 

Oligonucleotides designated GH28 and GH29 complementary to opposite strands of the conserved 5' 
and 3' ends of the DQ-0 second exon were used as primers. The primers, having the following sequences, 
35 were prepared as described in Example I. 

5 1 -CTCGGATCCGCATGTGCTACTTCACCAACG-3' (GH28) 
5 9 -GAGCTGCAGGTAGTTGTGTCTGCACAC-3 1 (GH29 ) 

40 

II. Preparation of Probes 

Based on the analysis of HLA-DQjS sequences from diverse sources, which were grouped into allelic 
4s variants, the following probes from two variable regions of the DQ0 second exon encompassing each variant 
were synthesized and labeled as described in Example I. The two regions, segments A (GH69-71) and B 
(GH60-62), as well as a third variable region, are shown in Table IV. The entire scheme of primers, probes 
and HLA-DQ0 sequence is shown in Table V, where the amino acid abbreviations are shown in Table III 
above. 

50 
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TABLE IV 



s HLA-0Q8 (segment A): 



20 25 30 

61 yTh rGl u ArgVal ArgGl yVa 1 Th rArgHI s H eTy r 
GGGACGGAGCGCGTGCGGGGTGTGACCAGACACATCTAT DRl 



10 



-TCT T DR2.4 

-TCT A DR6 

T DR8 

-T DR4" 



-A TCT G AG DR3.7 



is DXB: A C G T- 

HLA-OQB (segment B): 

20 45 50 55 60 

Va 1 Gl y Va 1 Ty r ArgAl aVa 1 Th rPr oGl nGlyAr gProVa 1 A1 aGl uTy rTrp As n 

GTGGGGGTGTACCGGGCAGTGACGCCGCAGGGGCGGCCTGTTGCCGAGTACTGGAAC 0R1 

i C—G A 0R2 

G — — ; DR6 

as ; j G T C CC DR4 

A G T C AC DR4' 

T G T T— AC T OR8 

A— T G -T— T T CC DR3.7 



30 



D XB : — T A — T — A G CGA — T AGCA-C -AG— C 



HLA-DQS (segment C): 

65 70 75 

35 Gl uVa 1 LeuGl uGlyAl aArgAl aSerValAspArgVal 

GAAGTCCTGGAGGGGGCCCGGGCGTCGGTGGACAGGGTG DRl 

A — GA-T C DR2 

-A— A —GA-T C DR4.6 



— CA A— A — — CC DR8 

40 — CA A — AAA G DR3.7 

— CT — T CA— AG CG- 
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The probes are as follows, where * is the label and " indicates the best probes. 
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5 



5 4 CGGCAGGCGGCCCCAGCGG-3 1 

5 ' CGGCAGGCAGCCCCAGCAG- 3 ' 

5 1 C AAC AGGCCGCCCCTGCGG- 3 1 

5 ' GATGTGTCTGGTCACACCCCG- 3 ' 

5 * GATGC TTC TGC TCACAAGACG- 3 ' 

5 1 GATGTATCTGGTCACA AGAC G- 3 1 



(GH71) 



(GH60)** 
(GH61)** 
(GH62) 



(GH69)** 
(GH70) 



10 

II. Amplification and Dot Blot Hybridization 

Using the method generally described in Example IIIA, the probes were found to have reasonable 
specificity for the portions of the allele being detected in genomic DMA samples. 

75 

C. Amplification and Detection of HLA-DR0 Sequences 
I. Preparation of Primers 

20 Oligonucleotides designated GH46 and GH50 complementary to opposite strands of the conserved 5' 
and 3' ends of the DR0 second exon were used as primers. The primers, having the following sequences, 
were prepared as described in Example I. 



30 II. Preparation of Probes 

Based on the analysis of HLA-DR0 sequences from diverse sources, which were grouped into allelic 
variants, the following probes from two variable regions of the DR0 second exon encompassing each variant 
were synthesized and labeled as described in Example I. The two regions, segments A (GH56-59) and B 
35 (GH51), are shown in Table VI. 



25 



5 ' -CCGGATCCTTCGTGTCCCCACAGCACG- 3 ' (GH46) 
5 1 -CTCCCCAACCCCGTAGTTGTGTCTGCA-3* (GH50) 



40 



45 



50 



55 
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The probes are as follows, where * is the label. 
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5' ttgatcaggttccacactcg-3' 

5 ' c agacgtagagtactcc- 3 ' 

5* cagacttacgcagctcc-3 ' 

5' :agacttaagcagctcc-3' 

5' catgtttaacctgctcc-3' 



(GH51) 
(GH56) 
(GH57) 
(GH58) 
(GH59) 
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III. Amplification and Dot Blot Hybridization 

Using the method generally described in Example MA, the probes were found to have reasonable 
specificity for the portions of the allele being detected in genomic DNA samples. 

5 

IV. Analysis of HLA-DR0 Sequences Associated With IDDM 

Several HLA class II beta genes were isolated from clinical blood samples of diverse HLA-typed IDDM 
individuals (from University of Pittsburgh clinic and from cell lines from IDDM patients available from the 

w Human Genetic Mutant Cell Repository, Camden, NJ) and non-diabetic controls (homozygous typing cells) 
using cloning methods. In one such method, which is a standard method, human genomic DNA was isolated 
from the patient samples using essentially the method of Maniatis et aL, Molecular Cloning (1982), 280-281 
or prepared from the buffy coat fraction, which is composed primarily of peripheral blood lymphocytes, as 
described by Saiki et al., Biotechnology , 3:1008-1012 (1985). This DNA was then cloned as full genomic 

is libraries into bacteriphage vectors, as described in Maniatis, supra , pp. 269-294. Individual clones for the 
HLA-DR/3 genes were selected by hybridization to radioactive cDNA probes (Maniatis, pp. 309-328) and 
characterized by restriction mapping. See U.S. Patent No. 4,582,788 issued April 15, 1986. Individual clones 
from IDDM patients were assigned to DR-typed haplotypes by comparing the clone restriction map with the 
RFLP segregation pattern within the patient's family. Finally, small fragments of these clones representing 

20 the variable second exon were subcloned (Maniatis, pp. 390-402) into the M13mp10 cloning vector, which is 
publicly available from Boehringer-Mannheim. 

In an alternative procedure for cloning the genes, amplification of the relevant portion (the second exon) 
of the gene was carried out from a total of 1 microgram of each isolated human genomic DNA as described 
in Example I using primers QH46 and GH50, which have non-homologous sequences to act as 

25 linker/primers. 

The reaction mixtures were subjected to 28 cycles of amplification and then the mixtures were stored at 
-20* C. Then the following cloning procedure was used for the amplified products. 

The reaction mixture was sub-cloned into M13mp10 by first digesting in 50 ul of a buffer containing 50 
mM NaCI. 10 mM Tris*HCI, pH 7.8, 10 mM MgCfe. 20 units Pstl and 26 units Hindlll at 37* C for 90 
30 minutes. The reaction was stopped by freezing. The volume was adjusted to 110 ul with a buffer containing 
Tris'HCI and EDTA and loaded onto a 1 ml BioGel P-4 spin dialysis column. One fraction was collected 
and ethanol precipitated. 

The ethanol pellet was resuspended in 1 5 ul water and adjusted to 20 u\ volume containing 50 mM 
Tris*HCI, pH 7.8, 10 mM MgCl 2 . 0.5 mM ATP, 10 mM dithiothreitol, 0.5 HQ M13mp10 vector digested with 

35 Pstl and Hindlll and 400 units ligase. This mixture was incubated for three hours at 16* C. 

Ten microliters of ligation reaction mixture containing Molt 4 DNA was transformed into E. coli strain 
JM103 competent cells, which are publicly available from BRL in Bethesda, MD. The prcrcedureTfoHowed for 
preparing the transformed strain is described in Messing, J. (1981) Third Cleveland Symposium on 
Macromolecules:Recombinant DNA , ed. A. Walton, Elsevier, Amsterdam, 143-153" 

40 Eighteen of the alleles from these two cloning procedures were sequenced. In some of the sequences 
determined four areas of specific DNA and protein sequence were found to occur in various combinations 
and to be associated with IDDM. The DNA sequences seen in the genomes of IDDM patients produced an 
alteration in one to three amino acid residues of the DR0 protein. These four variable regions of the DRfi 
second exon are found in sequences obtained from many diabetic sources and are identified above. These 

45 regions can be used for synthesizing primers and probes used for detecting such sequences. These IDDM 
related sequences are identifiable as LR-S, -FL-, V-S, and l-DE, with LR-S and -PL- being found at amino 
acid residue positions 10 to 13 and 36 to 39 in the DR beta second exon. 

V. Primers and Amplification 

so 

Primers GH46 and GH50 described in Example CI may be employed to amplify DNA samples to be 
tested for IDDM. The amplification procedure of Example I or IIIA may be employed, using also 10 jxl 
DMSO. 

55 VI. Expected Synthesis of Probes 

Two of four labeled DNA probes, designated GH54 (V--S). 5' CCTGTCGCCGAGTCCTGG) and GH78 
(l-DE, 5 f GACATCCTGGAAGACGAGAGA) may be employed. These probes may be synthesized as 
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described for the primers and labeled as described above. 

VII. Expected Dot Blot Hybridizations 

5 Using the dot blot method generally described in Example I, under stringent conditions, the probes are 
expected to have reasonable specificity for the portions of the allele being detected in genomic DNA 
samples. 

D. Amplification and Detection of HLA-DPa and DPjS Sequences 

10 

The known DP/8 sequences, showing the type of polymorphism already known, are depicted in Figure 6 
of Trowsdale et ah. Immunological Reviews , No. 85 (1985), p 5-43, at page 16. Further polymorphisms may 
be identified. Primers for the conserved segments and probes to the variable segments of these genes can 
be designed similarly to what is described above. 
is The nucleotide sequence of DPal alleles obtained from cDNA clones showing the type of polymorphism 
already known are depicted in Figure 4 of Trowsdale et al., supra, at page 12. 

The detection and amplification of such sequences may be clinically useful in bone-marrow transplan- 
tations and in tissue typing. 

20 EXAMPLE IV 

Frozen Molt 4 cells (a T cell line homozygous for normal 0-globin from Human Genetic Mutant Cell 
Repository, Camden, NJ as GM2219C). SC1 cells (a EBV-transformed B cell line homozygous for the sickle 
eel! allele from ATCC, Rockville, MD as CRL8756) and GM2064 cells (control described above having no 0- 

25 globin or 5-globin sequence) were thawed and resuspended in phosphate buffered saline, such that 10 ul of 
cells containing cell numbers varying from 37 to 1200 for each type of cell line was obtained. Each cell line 
was mixed with 35 ul water and then overlaid with mineral oil. The resulting suspension was heated at 95* C 
for 10 minutes. Then a total of 55 ul of the reagents used to amplify the cell lines in Example I, including 
primers PC03 and PC04, was added. 

30 The amplification procedure of Example 1 can then be used followed by the dot blot procedure. This 
direct use of the cells eliminates isolating the genomic DNA from the cell line or clinical sample. 

EXAMPLE V 

35 The procedure of Example ! was used to amplify the genomic DNA from a known normal individual, a 
known sickle cell individual, and an individual with no 0-globin gene sequences (GM2064), except that the 
amplification was automated as described in Example IIIA. 

Three labeled DNA probes, designated RS31, RS32. and RS33, of the following sequences were 
prepared as follows: 



5 1 TCC TGAG6AG AAG TCTG-3' (R S3 1 ) 
5 ' CCTGAGGAGAAGTCT- 3 ' (R S32 ) 
5 ' C TGAGG AGAAGTC- 3 1 ( R S3 3 ) 

45 

where * indicates the label. These probes are 17, 15, and 13 bases long, respectively, and are complemen- 
tary to the normal 0-globin allele (0 A ). The probes were synthesized and labeled as described in Example I. 
Probes RS32, RS33, and RS18 were tested for specificity for non-amplified cloned normal and sickle- 

50 cell globin sequences using the procedure described in Example I, except that the hybridization tempera- 
ture was reduced from 55* C to below 32* C. At hybridization temperatures below 55* C the RS18 (19-mer) 
did not show specificity unless the salt concentration was reduced. The two shorter probes showed 
excellent specificity at the higher salt content. 

The probes RS31, RS32, RS33, and RS18 were tested against the amplified genomic DNA. Because of 

55 the conditions of temperature and salt concentration, the 19-mer showed no specificity at a hybridization 
temperature of below 32 *C. All of the shorter mers did show specificity. These results clearly demonstrate 
that the shorter probes can be selective, and that the conditions of selectivity were less extreme than those 
needed for the 19-mer. 
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The 17-mer globin probe RS31 worked optimally when hybridized below 32* C in 6 x SSPE (with 10 x 
Denhardt's and 0.1% SDS) and then washed In 0.1 x SSPE for 10 minutes at 42* C. 

The 15-mer globin probe RS32 worked optimally when hybridized below 32* C in 6 x SSPE (with 10 x 
Denhardt's and 0.1% SDS) and then washed in 0.1 x SSPE for 10 minutes at 32* C. 
5 The 13-mer globin probe RS33 worked optimally when hybridized below 32 *C in 6 x SSPE (with 10 x 
Denhardfs and 0.1% SDS) and then washed in 0.1 x SSPE for 10 minutes at 25* C. 

EXAMPLE VI 

70 I. Synthesis of the Primers 

Two primers identified below were synthesized by the method described in Example I to amplify a 
portion of the second exon of the ^-globin gene: 

15 5 1 -ATTTTCCCACCCTTAGGCTG-3 1 (RS40) 

5 1 -GCTCACTCAGTGTGGCAAAG- 3 1 (RS42) 

This primer pair defines a 198 base pair amplification product that includes the sites of three relatively 
20 common 0-thalassemia mutations-the codon 39 non sense mutation, the codon 41-42 frameshift deletion, 

and the codon 44 frameshift deletion. 

Because the /3-globin gene in the region o' codons 39 to 42 is exactly homologous to delta-globin, the 

primers were designed to be specific for and only amplify /3-globin. The RS40 primer spans the first intron- 

second exon junction where there are six base pair mismatches with 5-giobin. RS42 is positioned over 
25 codons 84 to 91 and also contains six mismatches with 5-globin. These mismatches are sufficient to prevent 

hybridization of the primers to the 5-globin gene. After 20 cycles of amplification, the overall efficiency of 

these primers was approximately 80% and corresponded to a 130,000-fold amplification. As expected, the 

amplification product contained no detectable 5-globin DNA. 

30 II. Isolation of Human Genomic DNA From Clinical Samples 

Five genomic clinical DNA samples of various ^-thalassemia genotypes were obtained from Drs. Alan 
Scott and Haig Kazazian (Johns Hopkins University, Baltimore, MD). These samples were JH1 (normal/39 
non), JH2 (39 non/39 non), JH3 (normal/41 deletion), JH4 (17 non/41 del), and JH5 (39 non/44 deletion). 

35 

III. Amplification Reaction 

One microgram portions of each DNA sample and of Molt 4 as control were diluted into a 100 u.1 
volume with 50 mM NaCI, 10 mM Tris'HCI (pH 7.6), 10 mM MgCfe. 1 U.M primer PC03, 1 uM primer PC04, 
40 10% DMSO (v/v), 1.5 mM dATP, 1.5 mM dCTP, 1.5 mM dGTP, and 1.5 mM dTTP and subjected to 25 
cycles of automated amplification as described in Example IIIA, adding 1 unit of Klenow fragment at each 
cycle. 

IV. Oligodeoxy ribonucleotide Probes 

45 

Four DNA probes, designed XX1 , XX2, XX3 and XX4 below, were provided by Drs. Scott and Kazazian 
at Johns Hopkins University: 



50 5' CCTTGGACCTAGAGGTTCT-3' (XXI) 

5 1 CCTTGGACCCAGAGGTTC T-3 ' (XX 2 ) 

5 1 GGTTCTTTGAGTCCTTTGG- 3 1 ( X X 3) 

5' 5GTT 6AGTCCTTTGGG6AT-3' (XX4) 

55 

where * indicates the label later attached. The XX1 and XX2 pair was used to detect the codon 39 non- 
sense mutation, with XX1 complementary to the normal allele and XX2 complementary to the non sense 
mutant. The other pair of probes was designed to test for the 41-42 frameshift deletion, with XX3 annealing 
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to the normal allele and XX4 to the deletion mutant Each of the probes was phosphorylated as described in 
Example I. 

V. Dot Blot Hybridizations 

5 

Four replicate dot blots were prepared, each spot containing one-eighteenth of the amplification product 
(56 ng of genomic DNA). Each filter was individually prehybridized in 8 ml 5 x SSPE, 5 x Denhardt's 
solution, 0.5% SDS for 15 minutes at 55* C. One-half pmole of each labeled probe (specific activities 
ranged from 1 .8 to 0.7 uCi/pmole) was added and the hybridization was continued for an additional 60 
70 minutes at the same temperature. The filters were washed twice at room temperature in 2 x sodium saline 
phosphate EDTA (SSPE), 0.1% SDS, for 5-10 minutes per wash, followed by a high-stringency wash in 5 x 
SSPE, 0.1% SDS for 10 minutes at 60 *C. Autoradiograms were developed after overnight and two-hour 
exposures with a single intensification screen. 

75 VI. Autoradiogram Results 

The results were consistent with the listed genotypes of each DNA sample. Each probe annealed only 
to those genomic sequences with which it was perfectly matched. 

20 EXAMPLE VII 

The method herein may also be applied for forensic uses, by amplifying a random polymorphic region, 
e.g., HLA or mitochondrial DNA, to detect, e.g., nucleic acids in any body samples such as, e.g., hair 
samples, semen and blood samples, and other samples containing DNA. The nucleic acid may be extracted 
25 from the sample by any means, and primers and probes are selected based on identifying characteristics or 
known characteristics of the nucleic acid being detected. 

EXAMPLE VIII 

30 Purification of a Polymerase From Therm us aquaticus 

Thermus aquaticus strain YT1, available without restriction from the American Type Culture Collection, 
12301 Parklawn Drive. Rockville, MD, as ATCC No. 25.104 was grown in Rasks in the following medium: 



Sodium Citrate 


1 mM 


Potassium Phosphate, pH 


7.9 5 mM 


Ammonium Chloride 


10 mM 


Magnesium Sulfate 


0.2 mM 


Calcium Chloride 


0.1 mM 


Sodium Chloride 


1 g/l 


Yeast Extract 


1 g/I 


Tryptone 


1 g/l 


Glucose 


2 g/l 


Ferrous Sulfate 


0.01 mM 



(The pH was adjusted to 8.0 prior to autoclaving.) 

A 10-liter fermentor was inoculated from a seed flask cultured overnight in the above medium at 70* C. 
A totaJ of 600 ml from the seed flask was used to inoculate 10 liters of the same medium. The pH was 
so controlled at 8.0 with ammonium hydroxide with the dissolved oxygen at 40%, with the temperature at 
70" C, and with the stirring rate at 400 rpm. 

After growth of the cells, they were purified using the protocol (with slight modification) of Kaledin et al. a 
supra, through the first five stages and using a different protocol for the sixth stage. All six steps were 
conducted at 4*C. The rate of fractionation on columns was 0.5 column volumes/hour and the volumes of 
55 gradients during elution were 10 column volumes. 

Briefly, the above culture of the T. aquaticus cells was harvested by centrifugation after nine hours of 
cultivation, in late log phase, at a cell density o' 1.4 g dry weight/I. Twenty grams of cells was resuspended 
in 80 ml of a buffer consisting of 50 mM Tris*HCI pH 7.5, 0.1 mM EDTA. Cells were lysed and the lysate 
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was centrifuged for two hours at 35,000 rpm in a Beckman Tl 45 rotor at 4*C. The supernatant was 
collected (fraction A) and the protein fraction precipitating between 45 and 75% saturation of ammonium 
sulfate was collected, dissolved in a buffer consisting of 0.2 M potassium phosphate buffer, pH 6.5, 10 mM 
2-mercaptoethanol, and 5% glycerine, and finally dialyzed against the same buffer to yield fraction B. 

5 Fraction B was applied to a 2.2 x 30-cm column of DEAE-celluIose, equilibrated with the above 
described buffer. The column was then washed with the same buffer and the fractions containing protein 
(determined by absorbance at 280 nm) were collected. The combined protein fraction was dialyzed against 
a second buffer, containing 0.01 M potassium phosphate buffer, pH 7.5, 10 mM 2-mercaptoethanol, and 5% 
glycerine, to yield fraction C. 

10 Fraction C was applied to a 2.6 x 21 -cm column of hydroxyapatite, equilibrated with a second buffer. 
The column was then washed and the enzyme was eluted with a linear gradient of 0.01-0.5 M potassium 
phosphate buffer, pH 7.5, containing 10 mM 2-mercaptoethanol and 5% glycerine. Fractions containing DNA 
polymerase activity (90-180 mM potassium phosphate) were combined, concentrated four-fold using an 
Amicon stirred cell and YM10 membrane, and dialyzed against the second buffer to yield fraction D. 

15 Fraction D was applied to a 1 .6 x 28-cm column of DEAE-celluIose, equilibrated with the second buffer. 
The column was washed and the polymerase was eluted with a linear gradient of 0.01-0.5 M potassium 
phosphate in the second buffer. The fractions were assayed for contaminating endonuclease(s) and 
exonuclease(s) by electrophoretically detecting the change in molecular weight of phage X DNA or 
supercoiled plasma DNA after incubation with an excess of DNA polymerase (for endonuclease) and after 

20 treatment with a restriction enzyme that cleaves the DNA into several fragments (for exonuclease). Only 
those DNA polymerase fractions (65-95 mM potassium phosphate) having minimal nuclease contamination 
were pooled. To the pool was added autoclaved gelatin in an amount of 250 jxg/ml, and dialysis was 
conducted against the second buffer to yield Fraction E. 

Fraction E was applied to a 9 ml phosphocellulose column and eluted with a 100 ml gradient (0.01-0.4 

25 M KCI gradient in 20 mM potassium phosphate buffer pH 7.5). The fractions were assayed for contaminat- 
ing endo/exonuclease(s) as described above as well as for polymerase activity (by the method of Kaledin et 
al.)* and then pooled. The pooled fractions were dialyzed against the second buffer, then concentrated by 
dialysis against 50% glycerine and the second buffer. 

The molecular weight of the polymerase was determined by SDS PAGE. Marker proteins (Bio-Rad low 

30 molecular weight standards) were phosphorylase B (92,500), bovine serum albumin (66,200), ovalbumin 
(45,000), carbonic anhydrase (31,000), soybean trypsin inhibitor (21,500), and lysozyme (14,400). 

Preliminary data suggest that the polymerase has a molecular weight of about 86,000-90,000 daltons, 
not 62,000-63,000 daltons reported in the literature (e.g., by Kaledin et al.). 

35 EXAMPLE IX 

It is expected that the procedure of Example I may be repeated using a biotinylated probe prepared as 
described in U.S. Patent Nos. 4,582,789 and 4,617,261. 

In summary, techniques wherein nucleic acids are amplified in a chain reaction in which primer 
40 extension products are produced which can subsequently act as templates, and the amplified samples are 
analyzed using sequence-specific probes provide several important advantages. Procedure is simplified 
because the amplified samples can be spotted on a filter membrane as a dot blot, thereby avoiding the 
restriction digestion, electrophoresis and gel manipulations otherwise required. It is a more specific 
procedure because the amplification greatly increases the ratio of specific target sequence to cross- 
45 hybridizing sequences. 

In addition, the process improves sensitivity by 10 3 -10 4 . An interpretable signal can be obtained with a 
1 ng sample after an overnight exposure. Finally, by increasing the amount of sample applied to the filter to 

0. 1 to 0.5 ng, it is possible that biotinylated oligonucleotide probes may be utilized. 

so Claims 

1. A group of oligonucleotide probes in which the individual probes are capable of hybridizing to specific 
gene sequence variations so as to permit detection of said variations when said probes are labelled and 
so-hybridized, whereby allelic variations and gene polymorphism in samples may be detected using 

55 said group of probes. 

2. A probe group as claimed in claim 1 wherein the individual probes are each 15 to 25 nucleotides in 
length. 
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3. A probe group as claimed in claim 1 or claim 2 wherein said gene sequence variations are regions of 
base pair variation in a gene which are clustered, and said probe group includes one probe for each 
allele. 

5 4. A probe group as claimed in claim 1 or claim 2 wherein said gene sequence variations are regions of 
base pair variation in a gene which are spread apart, and said probe group includes more than one 
probe, each encompassing an allelic variant, for each allele. 



10 



5. A probe group as claimed in any one of claims 1 to 4 for use in HLA typing. 

6. A probe group as claimed in any one of claims 1 to 4 for use in a diagnostic or forensic application. 

7. A probe group as claimed in claim 6 for use in detecting insulin-dependent diabetes. 

75 8. A probe group selected from probe groups wherein the individual probes exhibit complementarity to or 
comprise the following sequences:- 



20 



25 



30 



35 



40 



45 



(i) 
(ii) 

(iii) 



(iv) 

(v) 
(vi) 

(vii) 



5 
5 
5 

5 
5 
5 
5 

5 
5 
5 
5 
5 
5 

5 
5 
5 
5 
5 

5 
5 

5 

5 
5 

5 
5 
5 
5 



-CTCCTAAGGAGAAGTCTGC-3 ' 
- CTCCTGAGG AG AAGTCTGC - 3 ' 
-CTCCTGTGGAGAAGTCTGC-3 ' ; 

-TGTTTGCCTGTTCTCAGAC-3 ' 
-TTCCGCAGATTTAGAAGAT-3 ' 
-TTCCACAGACTTAGATTTG-3 ' 
-CTCAGGCCACCGCCAGGCA-3 ' ; 

-CGGCAGGCGGCCCCAGCGG-3 ' 
-CGGCAGGCAGCCCC AGC AG- 3 ' 
-CAACAGGCCGCCCCTGCGG-3 ' 
-GATGTGTCTGGTCACACCCCG- 3 ' 
- G ATG CTTCTGCTC AC AAG ACG - 3 ' 
-GATGTATCTGGTCACAAGACG-3 ' ; 

-CTGATCAGGTTCCACACTCG-3 ' 
-CAGACGTAGAGTACTCC-3 ' 
-CAGACTTACGCAGCTCC-3 ' 
-C AG ACTTAAGCAGCTCC- 3 ' 
-CATGTTTAACCTGCTCC-3 ' ; 

CCTGTCGCCGAGTCCTGG 
GACATCCTGGAAGACGAGAGA ; 

-TCCTGAGGAGAAGTCTG-3 ' 
- C CTG AGG AG AAGTCT - 3 ' 
-CTGAGGAGAAGTC-3 9 ; or 

-CCTTGGACCTAGAGGTTCT-3 ' 
-CCTTGGACCCAGAGGTTCT-3 ' 
-GGTTCTTTGAGTCCTTTGG-3 ' 
-GGTT GAGTCCTTTGGGGAT-3 ' 
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9. A probe group selected from any of the following:- 
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ft 



( i ) 5 ' -CTCCTAAGGAGAAGTCTGC- 3 ' 

5 ' -CTCCTGAGGAGAAGTCTGC-3 ' 
5 ' -CTCCTGTGGAGAAGTCTGC-3 ' ; 

( ii ) 5 ' -TGTTTGCCTGTTCTCAGAC-3 ' 

5 ' - TTCCGCAG ATTTAG AAGAT - 3 ' 
5 ' -TTCCACAGACTTAGATTTG-3 ' 
5 ' -CTCAGGCCACCGCCAGGCA-3 ' ; 

( iii ) 5 ' -CGGCAGGCGGCCCCAGCGG-3 ' 

5 9 -CGGCAGGCAGCCCCAGCAG-3 ' 
5 ' -CAACAGGCCGCCCCTGCGG-3 9 
5 ' -GATGTGTCTGGTCACACCCCG-3 ' 
5 ' -GATGCTTCTGCTCACAAGACG-3 ' 
5 ' -GATGTATCTGGTCACAAGACG-3 9 ; 



( iv) 5 ' -CTGATCAGGTTCCACACTCG-3 ' 

5 ' -CAGACGTAGAGTACTCC-3 9 
5 ' -CAGACTTACGCAGCTCC-3 ' 
5 ' -CAGACTTAAGCAGCTCC-3 ' 
5 ' -C ATGTTT AACCTGCTCC- 3 ' ; 

(v) 5 9 CCTGTCGCCGAGTCCTGG 

5 ' GACATCCTGGAAGACGAGAGA ; 

( vi ) 5 9 -TCCTGAGGAGAAGTCTG-3 ' 

5 9 -CCTGAGGAGAAGTCT-3 ' 
5 9 -CTGAGGAGAAGTC-3 9 ; or 

( vii ) 5 ' -CCTTGGACCTAGAGGTTCT-3 ' 
5 ' -CCTTGGACCCAGAGGTTCT-3 ' 
5 ' -GGTTCTTTGAGTCCTTTGG-3 ' 
5 ' -GGTT GAGTCCTTTGGGGAT-3 ' 



10. A probe group as claimed in any one of claims 1 to 9, which probe group is solid-bound. 

11. A method of providing a group of probes for detecting allelic variations or gene polymorphism in 
samples, comprising selecting as a group a number of oligonucleotide probes in which the individual 
probes are capable of hybridizing to specific gene sequence variations so as to permit detection of said 
variations when said probes are labelled and so-hybridized; optionally said selected oligonucleotides 
being a group as defined in any one of claims 2 to 10. 
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